Ensinador: corpus-based Portuguese grammar exercises
نویسندگان
چکیده
In this paper we describe Ensinador, a tool that creates grammar exercises for teaching Portuguese as a foreign language, at university level, based on large annotated corpora. After discussing the motivation for the tool, and the context in which it was deployed, we present its specification and its implementation, discussing the particular requirements with concrete examples. We report actual use of the tool in teaching, and compare it to other systems in the literature, ending the paper with a short description of future and ongoing related work.
منابع مشابه
CoGrOO: a Brazilian-Portuguese Grammar Checker based on the CETENFOLHA Corpus
This paper describes an ongoing Portuguese Language grammar checker project, called CoGrOO1-Corretor Gramatical para OpenOffice (Grammar Checker for OpenOffice), based on CETENFOLHA, a Brazilian Portuguese morphosyntactic annotated Corpus. Two of its features are highlighted: hybrid architecture, mixing rules and statistics; free software project. This project aims at checking grammatical error...
متن کاملBaseline Acoustic Models for Brazilian Portuguese Using CMU Sphinx Tools
Advances in speech processing research rely on the availability of public resources such as corpora, statistical models and baseline systems. In contrast to languages such as English, there are few specific resources for Brazilian Portuguese. This work describes efforts aiming to decrease such gap. Baseline acoustic models for Brazilian Portuguese were built using the CMU Sphinx toolkit and pub...
متن کاملUniversal Dependencies for Portuguese
This paper describes the creation of a Portuguese corpus following the guidelines of the Universal Dependencies Framework. Instead of starting from scratch, we invested in a conversion process from the existing Portuguese corpus, called Bosque. The conversion was done by applying a context-sensitive set of Constraint Grammar rules to its original deep linguistic analysis, which was carried out ...
متن کاملAnnotating COMPARA, a Grammar-aware Parallel Corpus
In this paper we describe the annotation of COMPARA, currently the largest post-edited parallel corpora which includes Portuguese. We describe the motivation, the results so far, and the way the corpus is being annotated. We also provide the first grounded results about syntactical ambiguity in Portuguese. Finally, we discuss some interesting problems in this connection.
متن کاملPhonologic Patterns of Brazilian Portuguese: a grapheme to phoneme converter based study
This paper presents Brazilian Portuguese phoneme patterns of distribution, according to an automatic grammar rulesbased grapheme to phoneme converter. The software Nhenhém (Vasilévski, 2008) was used for treating data: written texts which were decoded into phonologic symbols, forming a corpus, and subjected to a statistical analysis. Results support the high level of predictability of Brazilian...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Procesamiento del Lenguaje Natural
دوره 47 شماره
صفحات -
تاریخ انتشار 2011